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(A) APPLICATION NUMBER: 07/615,784 

(B) FILING DATE: 14-NOV-1990 

(A) APPLICATION NUMBER: 07/567,373 

(B) FILING DATE: 13-AUG-1990 

(A) APPLICATION NUMBER: 07/494,106 

(B) FILING DATE: 16 -MAR- 19 90 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Lassen, Elizabeth 

(B) REGISTRATION NUMBER: 31,845 

(A) NAME: Donna E. Scherer 

(B) REGISTRATION NUMBER: 34,719 

(C) REFERENCE /DOCKET NUMBER: CGNE 69-4 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (916) 753-6313 

(B) TELEFAX: (916) 753-1510 

(C) TELEX: 350370 CGNE 

(2) INFORMATION FOR SEQ ID NO : 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ala Ser Thr Leu Gly Ser Ser Thr Pro Lys Val Asp Asn Ala Lys Lys 
15 10 15 

Pro Phe Gin Pro Pro Arg Glu Val His Val Gin Val Thr His Xaa Met 
20 25 30 
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Pro Pro Gin Lys lie Glu lie Phe Lys Ser lie Glu Gly Xaa Ala Glu 

35 40 45 

Gin Asn lie Leu Val Xaa Leu Lys Pro Val Glu Lys Cys Trp Gin 
50 55 60 


(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Phe Leu Pro Asp Pro Ala Xaa Glu Gly Phe Asp Glu Gin Val Lys 
15 10 15 

Glu Leu Arg Ala Arg Ala Lys Glu lie Pro Asp Asp Tyr Phe Val Val 
20 25 30 

Leu Val Gly Asp Met lie Thr Glu Glu Ala Leu Pro Thr Tyr Gin Thr 
35 40 45 

Met Leu Asn Thr Leu Asp Gly Val 
50 55 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Glu Thr Gly Ala Ser Leu Thr Pro Trp Ala Val Trp Thr 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Leu Leu His Thr Tyr Leu Tyr Leu Ser Gly Arg Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Met Arg Gin lie Gin Lys Thr lie Gin Tyr Leu lie 
15 10 

(2) INFORMATION FOR SEQ ID NO : 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Thr Glu Asn Ser Pro Tyr Leu Gly Phe lie Tyr Thr Ser Phe Gin Glu 
15 10 15 

Arg 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Asp Val Xaa Leu Ala Gin lie Xaa Gly Thr lie Ala Ser Asp Glu Lys 
15 10 15 

Arg His Glu Thr Ala Tyr Thr Lys lie Val Glu Lys Leu Phe Glu lie 
20 25 30 

Asp Pro Asp Gly Thr Val Leu Ala Phe Ala Asp Met Met Arg Lys Lys 
35 40 45 

lie Xaa Met Pro Ala His Leu Met Tyr 
50 55 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asp Asn Leu Phe 
1 
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(2) INFORMATION FOR SEQ ID NO : 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Xaa Xaa Phe Xaa Ala Val Xaa Gin Arg Leu Xaa Val Tyr Thr Ala Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Tyr Ala Asp lie Leu Glu Phe Leu Val Gly Arg Trp Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 


Val Ala Asp Leu Thr Gly Leu Ser Gly Glu Gly Arg Lys Ala Xaa Asp 
15 10 15 

Tyr Val Cys Gly Leu Pro Pro Arg lie Arg Arg Leu Glu Glu Arg Ala 
20 25 30 

Gin Gly Arg Ala Lys Glu Gly Pro Val Val Pro Phe Ser Trp lie Phe 
35 40 45 

Asp Arg Gin Val Lys Leu 
50 


(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCTCACTTGT GTGGTGGAGG AGAAAAACAG AACTCACAAA AAGCTTTGCG ACTGCCAAGA 60 

ACAACAACAA CAACAAGATC AAGAAGAAGA AGAAGAAGAT CAAAAATGGC TCTTCGAATC 120 

ACTCCAGTGA CCTTGCAATC GGAGAGATAT CGTTCGTTTT CGTTTCCTAA GAAGGCTAAT 180 

CTCAGATCTC CCAAATTCGC CATGGCCTCC ACCCTCGGAT CATCCACACC GAAGGTTGAC 240 

AATGCCAAGA AGCCTTTTCA ACCTCCACGA GAGGTTCATG TTCAGGTGAC GCACTCCATG 300 

CCACCACAGA AGATAGAGAT TTTCAAATCC ATCGAGGGTT GGGCTGAGCA GAACATATTG 3 60 

GTTCACCTAA AGCCAGTGGA GAAATGTTGG CAAGCACAGG ATTTCTTGCC GGACCCTGCA 420 

TCTGAAGGAT TTGATGAACA AGTCAAGGAA CTAAGGGCAA GAGCAAAGGA GATTCCTGAT 480 

GATTACTTTG TTGTTTTGGT TGGAGATATG ATTACAGAGG AAGCCCTACC TACTTACCAA 540 

ACAATGCTTA ATACCCTAGA TGGTGTACGT GATGAGACTG GGGCTAGCCT TACGCCTTGG 600 

GCTGTCTGGA CTAGGGCTTG GACAGCTGAA GAGAACAGGC ATGGCGATCT TCTCCACACC 660 


7 



TATCTCTACC TTTCTGGGCG GGTAGACATG AGGCAGATAC AGAAGACAAT TCAGTATCTC 720 

ATTGGGTCAG GAATGGATCC TCGTACCGAA AACAGCCCCT ACCTTGGGTT CATCTACACA 780 

TCGTTTCAAG AGCGTGCCAC ATTTGTTTCT CACGGAAACA CCGCCAGGCA TGCAAAGGAT 840 

CATGGGGACG TGAAACTGGC GCAAATTTGT GGTACAATCG CGTCTGACGA AAAGCGTCAC 900 

GAGACCGCTT ATACAAAGAT AGTCGAAAAG CTATTCGAGA TCGATCCTGA TGGCACCGTT 960 

CTTGCTTTTG CCGACATGAT GAGGAAAAAG ATCTCGATGC CCGCACACTT GATGTACGAT 1020 

GGGCGTGATG ACAACCTCTT CGAACATTTC TCGGCGGTTG CCCAAAGACT CGGCGTCTAC 1080 

ACCGCCAAAG ACTACGCCGA CATACTGGAA TTTCTGGTCG GGCGGTGGAA AGTGGCGGAT 1140 

TTGACCGGCC TATCTGGTGA AGGGCGTAAA GCGCAAGATT ATGTTTGCGG GTTGCCACCA 1200 

AGAATCAGAA GGCTGGAGGA GAGAGCTCAA GGGCGAGCAA AGGAAGGACC TGTTGTTCCA 1260 

TTCAGCTGGA TTTTCGATAG ACAGGTGAAG CTGTGAAGAA AAAAAAAACG AGCAGTGAGT 1320 

TCGGTTTCTG TTGGCTTATT GGGTAGAGGT TAAAACCTAT TTTAGATGTC TGTTTCGTGT 1380 

AATGTGGTTT TTTTTCTTCT AATCTTGAAT CTGGTATTGT GTCGTTGAGT TCGCGTGTGT 1440 

GTAAACTTGT GTGGCTGTGG ACATATTATA GAACTCGTTA TGCCAATTTT GATGACGGTG 1500 

GTTATCGTCT CCCCTGGTGT TTTTTTATTG TTT 1533 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 6 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ala Leu Arg lie Thr Pro Val Thr Leu Gin Ser Glu Arg Tyr Arg 
-30 -25 -20 

Ser Phe Ser Phe Pro Lys Lys Ala Asn Leu Arg Ser Pro Lys Phe Ala 
-15 -10 -5 
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Met Ala Ser Thr Leu Gly Ser Ser Thr Pro Lys Val Asp Asn Ala Lys 
15 10 15 

Lys Pro Phe Gin Pro Pro Arg Glu Val His Val Gin Val Thr His Ser 

20 25 30 

Met Pro Pro Gin Lys lie Glu lie Phe Lys Ser lie Glu Gly Trp Ala 
35 40 45 

Glu Gin Asn lie Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 
50 55 60 

Ala Gin Asp Phe Leu Pro Asp Pro Ala Ser Glu Gly Phe Asp Glu Gin 
65 70 75 

Val Lys Glu Leu Arg Ala Arg Ala Lys Glu lie Pro Asp Asp Tyr Phe 
80 85 90 95 

Val Val Leu Val Gly Asp Met lie Thr Glu Glu Ala Leu Pro Thr Tyr 

100 105 110 

Gin Thr Met Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 
115 120 125 

Ser Leu Thr Pro Trp Ala Val Trp Thr Arg Ala Trp Thr Ala Glu Glu 
130 135 140 

Asn Arg His Gly Asp Leu Leu His Thr Tyr Leu Tyr Leu Ser Gly Arg 
145 150 155 

Val Asp Met Arg Gin lie Gin Lys Thr lie Gin Tyr Leu lie Gly Ser 
160 165 170 175 

Gly Met Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe lie Tyr 

180 185 190 

Thr Ser Phe Gin Glu Arg Ala Thr Phe Val Ser His Gly Asn Thr Ala 
195 200 205 

Arg His Ala Lys Asp His Gly Asp Val Lys Leu Ala Gin lie Cys Gly 
210 215 220 

Thr lie Ala Ser Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys lie 
225 230 235 

Val Glu Lys Leu Phe Glu lie Asp Pro Asp Gly Thr Val Leu Ala Phe 
240 245 250 255 

Ala Asp Met Met Arg Lys Lys lie Ser Met Pro Ala His Leu Met Tyr 

260 265 270 

Asp Gly Arg Asp Asp Asn Leu Phe Glu His Phe Ser Ala Val Ala Gin 
275 280 285 
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Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu Glu Phe 
290 295 300 

Leu Val Gly Arg Trp Lys Val Ala Asp Leu Thr Gly Leu Ser Gly Glu 
305 310 315 

Gly Arg Lys Ala Gin Asp Tyr Val Cys Gly Leu Pro Pro Arg lie Arg 
320 325 330 335 

Arg Leu Glu Glu Arg Ala Gin Gly Arg Ala Lys Glu Gly Pro Val Val 

340 345 350 

Pro Phe Ser Trp lie Phe Asp Arg Gin Val Lys Leu 
355 360 


(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 5 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AAAAGAAAAA GGTAAGAAAA AAAACA ATG GCT CTC AAG CTC AAT CCT TTC CTT 53 

MET Ala Leu Lys Leu Asn Pro Phe Leu 
1 5 

TCT CAA ACC CAA AAG TTA CCT TCT TTC GCT CTT CCA CCA ATG GCC AGT 101 
Ser Gin Thr Gin Lys Leu Pro Ser Phe Ala Leu Pro Pro MET Ala Ser 
10 15 20 25 

ACC AGA TCT CCT AAG TTC TAC ATG GCC TCT ACC CTC AAG TCT GGT TCT 149 
Thr Arg Ser Pro Lys Phe Tyr MET Ala Ser Thr Leu Lys Ser Gly Ser 

30 35 40 

AAG GAA GTT GAG AAT CTC AAG AAG CCT TTC ATG CCT CCT CGG GAG GTA 197 
Lys Glu Val Glu Asn Leu Lys Lys Pro Phe MET Pro Pro Arg Glu Val 
45 50 55 

CAT GTT CAG GTT ACC CAT TCT ATT GCC A 225 
His Val Gin Val Thr His Ser He Ala 
60 65 
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(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AAAAGAAAAA GGTAAGAAAA AAAACAATGG CTCTCAAGCT CAATCCTTTC CTTTCTCAAA 60 

CCCAAAAGTT . ACCTTCTTTC GCTCTTCCAC CAATGGCCAG TACCAGATCT CCTAAGTTCT 120 

ACATGGCCTC TACCCTCAAG TCTGGTTCTA AGGAAGTTGA GAATCTCAAG AAGCCTTTCA 180 

TGCCTCCTCG GGAGGTACAT GTTCAGGTTA CCCATTCTAT GCCACCCCAA AAGATTGAGA 240 

TCTTTAAATC CCTAGACAAT TGGGCTGAGG AGAACATTCT GGTTCATCTG AAGCCAGTTG 300 

AG AAATGTT G GCAACCGCAG GATTTTTTGC CAGATCCCGC CTCTGATGGA TTTGATGAGC 3 60 

AAGTCAGGGA ACTCAGGGAG AGAGCAAAGG AGATTCCTGA TGATTATTTT GTTGTTTTGG 420 

TTGGAGACAT GATAACGGAA GAAGCCCTTC CCACTTATCA AACAATGCTG AATACCTTGG 480 

ATGGAGTTCG GGATGAAACA GGTGCAAGTC CTACTTCTTG GGCAATTTGG ACAAGGGCAT 540 

GGACTGCGGA AGAGAATAGA CATGGTGACC TCCTCAATAA GTATCTCTAC CTATCTGGAC 600 

GAGTGGACAT GAGGCAAATT GAGAAGACAA TTCAATATTT GATTGGTTCA GGAATGGATC 660 

CACGGACAGA AAACAGTCCA TACCTTGGGT TCATCTATAC ATCATTCCAG GAAAGGGCAA 720 

CCTTCATTTC TCATGGGAAC ACTGCCCGAC AAGCCAAAGA GCATGGAGAC ATAAAGTTGG 7 80 

CTCAAATATG TGGTACAATT GCTGCAGATG AGAAGCGCCA TGAGACAGCC TACACAAAGA 840 

TAGTGGAAAA ACTCTTTGAG ATTGATCCTG ATGGAACTGT TTTGGCTTTT GCTGATATGA 900 

TGAGAAAGAA AATTTCTATG CCTGCACACT TGATGTATGA TGGCCGAGAT GATAATCTTT 960 

TTGACCACTT TTCAGCTGTT GCGCAGCGTC TTGGAGTCTA CACAGCAAAG GATTATGCAG 1020 

ATATATTGGA GTTCTTGGTG GGCAGATGGA AGGTGGATAA ACTAACGGGC CTTTCAGCTG 1080 

AGGGACAAAA GGCTCAGGAC TATGTTTGTC GGTTACCTCC AAGAATTAGA AGGCTGGAAG 1140 
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AGAGAGCTCA AGGAAGGGCA AAGGAAGCAC CCACCATGCC TTTCAGCTGG ATTTTCGATA 1200 

GGCAAGTGAA GCTGTAGGTG GCTAAAGTGC AGGACGAAAC CGAAATGGTT AGTTTCACTC 12 60 

TTTTTCATGC CCATCCCTGC AGAATCAGAA GTAGAGGTAG AATTTTGTAG TTGCTTTTTT 1320 

ATTACAAGTC CAGTTTAGTT TAAGGTCTGT GGAAGGGAGT TAGTTGAGGA GTGAATTTAG 1380 

TAAGTTGTAG ATACAGTTGT TTCTTGTGTT GTCATGAGTA TGCTGATAGA GAGCAGCTGT 1440 

AGTTTTGTTG TTGTGTTCTT TTATATGGTC TCTTGTATGA GTTTCTTTTC TTTCCTTTTC 1500 

TTCTTTCCTT TCCTCTCTCT CTCTCTCTCT CTCTCTCTTT TTCTCTTATC CCAAGTGTCT 1560 

CAAGTATAAT AAGCAAACGA TCCATGTGGC AATTTTGATG ATGGTGATCA GTCTCACAAC 1620 

TTGATCTTTT GTCTTCTATT GGAAACACAG CCTGCTTGTT TGAAAAAA 1668 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

MET Ala Leu Lys Leu Asn Pro Phe Leu Ser Gin Thr Gin Lys Leu Pro 
15 10 15 

Ser Phe Ala Leu Pro Pro MET Ala Ser Thr Arg Ser Pro Lys Phe Tyr 
20 25 30 

MET Ala Ser Thr Leu Lys Ser Gly Ser Lys Glu Val Glu Asn Leu Lys 
35 40 45 

Lys Pro Phe MET Pro Pro Arg Glu Val His Val Gin Val Thr His Ser 
50 55 60 

MET Pro Pro Gin Lys lie Glu lie Phe Lys Ser Leu Asp Asn Trp Ala 
65 70 75 80 

Glu Glu Asn lie Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 

85 90 95 
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Pro Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Asp Glu Gin 
100 105 110 

Val Arg Glu Leu Arg Glu Arg Ala Lys Glu lie Pro Asp Asp Tyr Phe 
115 120 125 

Val Val Leu Val Gly Asp MET lie Thr Glu Glu Ala Leu Pro Thr Tyr 
130 135 140 

Gin Thr MET Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 
145 150 155 160 

Ser Pro Thr Ser Trp Ala lie Trp Thr Arg Ala Trp Thr Ala Glu Glu 

165 170 175 

Asn Arg His Gly Asp Leu Leu Asn Lys Tyr Leu Tyr Leu Ser Gly Arg 
180 185 190 

Val Asp MET Arg Gin lie Glu Lys Thr lie Gin Tyr Leu lie Gly Ser 
195 200 205 

Gly MET Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe lie Tyr 
210 215 220 

Thr Ser Phe Gin Glu Arg Ala Thr Phe lie Ser His Gly Asn Thr Ala 
225 230 235 240 

Arg Gin Ala Lys Glu His Gly Asp lie Lys Leu Ala Gin lie Cys Gly 

245 250 255 

Thr lie Ala Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys lie 
260 265 270 

Val Glu Lys Leu Phe Glu lie Asp Pro Asp Gly Thr Val Leu Ala Phe 
275 280 285 

Ala Asp MET MET Arg Lys Lys lie Ser MET Pro Ala His Leu MET Tyr 
290 295 300 

Asp Gly Arg Asp Asp Asn Leu Phe Asp His Phe Ser Ala Val Ala Gin 
305 310 315 320 

Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu Glu Phe 

325 330 335 

Leu Val Gly Arg Trp Lys Val Asp Lys Leu Thr Gly Leu Ser Ala Glu 
340 345 350 

Gly Gin Lys Ala Gin Asp Tyr Val Cys Arg Leu Pro Pro Arg lie Arg 
355 360 365 

Arg Leu Glu Glu Arg Ala Gin Gly Arg Ala Lys Glu Ala Pro Thr MET 
370 375 380 
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Pro Phe Ser Trp lie Phe Asp Arg Gin Val Lys Leu 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TGAGAGATAG TGTGAGAGCA TTAGCCTTAG AGAGAGAGAG AGAGAGCTTG TGTCTGAAAG 60 

AATCCACAA ATG GCA TTG AAG CTT AAC CCT TTG GCA TCT CAG CCT TAC AAC 111 
MET Ala Leu Lys Leu Asn Pro Leu Ala Ser Gin Pro Tyr Asn 
15 10 

TTC CCT 117 

Phe Pro 

15 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ACT TCA TGG GCT ATT TGG ACA AGA GCT TGG ACT GCA GAA GAG AAC CGA 48 
Thr Ser Trp Ala lie Trp Thr Arg Ala Trp Thr Ala Glu Glu Asn Arg 
15 10 15 




CAC GGT GAT CTT CTC AAT AAG TAT CTT TAC TTG TCT GGA CGT GTT GAC 9 6 
His Gly Asp Leu Leu Asn Lys Tyr Leu Tyr Leu Ser Gly Arg Val Asp 
20 25 30 

ATG AGG CAG ATT GAA AAG ACC ATT CAG TAC TTG ATT GGT TCT GGA ATG 144 
MET Arg Gin lie Glu Lys Thr lie Gin Tyr Leu lie Gly Ser Gly MET 
35 40 45 

GAT CCT AGA ACA GAG AAC AAT CCT TAC CTC GG 17 6 

Asp Pro Arg Thr Glu Asn Asn Pro Tyr Leu 
50 55 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1495 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA to mRNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 


TGAGAGATAG 

TGTGAGAGCA 

TTAGCCTTAG 

AGAGAGAGAG 

AGAGAGCTTG 

TGTCTGAAAG 

60 

AATCCACAAA 

TGGCATTGAA 

GCTTAACCCT 

TTGGCATCTC 

AGCCTTACAA 

CTTCCCTTCC 

120 

TCGGCTCGTC 

CGCCAATCTC 

TACTTTCAGA 

TCTCCCAAGT 

TCCTCTGCCT 

CGCTTCTTCT 

180 

TCTCCCGCTC 

TCAGCTCCAA 

GGAGGTTGAG 

AGTTTGAAGA 

AGCCATTCAC 

ACCACCTAAG 

240 

GAAGTGCACG 

TTCAAGTCCT 

GCATTCCATG 

CCACCCCAGA 

AGATCGAGAT 

CTTCAAATCC 

300 

ATGGAAGACT 

GGGCCGAGCA 

GAACCTTCTA 

ACTCAGCTCA 

AAGACGTGGA 

GAAGTCGTGG 

360 

CAGCCCCAGG 

ACTTCTTACC 

CGACCCTGCA 

TCCGATGGGT 

TCGAAGATCA 

GGTTAGAGAG 

420 

CTAAGAGAGA 

GGGCAAGAGA 

GCTCCCTGAT 

GATTACTTCG 

TTGTTCTGGT 

GGGAGACATG 

480 

ATCACGGAAG 

AGGCGCTTCC 

GACCTATCAA 

ACCATGTTGA 

ACACTTTGGA 

TGGAGTGAGG 

540 

GATGAAACTG 

GCGCTAGCCC 

CACTTCATGG 

GCTATTTGGA 

CAAGAGCTTG 

GACTGCAGAA 

600 

GAGAACCGAC 

ACGGTGATCT 

TCTCAATAAG 

TATCTTTACT 

TGTCTGGACG 

TGTTGACATG 

660 

AGGCAGATTG 

AAAAGACCAT 

TCAGTACTTG 

ATTGGTTCTG 

GAATGGATCC 

TAGAACAGAG 

720 
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AACAATCCTT 

ACCTCGGCTT 

CATCTACACT 

TCATTCCAAG 

CACGGAAACA 

CAGCTCGCCA 

AGCCAAAGAG 

CACGGAGACC 

GGCACAATAG 

CTGCAGACGA 

GAAGCGTCAT 

GAGACAGCTT 

CTCTTTGAGA 

TTGATCCTGA 

TGGTACTGTG 

ATGGCGTTTG 

ATCTCGATGC 

CTGCTCACTT 

GATGTACGAT 

GGGCGGGATG 

TCTTCTGTTG 

CTCAGAGGCT 

CGGTGTTTAC 

ACTGCCAAAG 

TTTTTGGTTG 

GGAGGTGGAA 

GATTGAGAGC 

TTGACCGGGC 

GCGCAAGAGT 

ACTTGTGTGG 

GTTGACTCCA 

AGAATCAGGA 

GCAAGAGCCA 

AGAAAGGACC 

CAAGGTTCCT 

TTCAGCTGGA 

CTCTAAAAAG 

GAACAAAGCT 

ATGAAACCTT 

TTCACTCTCC 

TCTGCTCTTG 

AAATTGGTGT 

AGATTACTAT 

GGTTTGTGAT 

CAAAGTTGAG 

AAGCAGTGAT 

TTAGTAGCTT 

TGTTGTTTCC 

TTTGGTCCTT 

TTAGTAAACT 

TGTTGTAGTT 

AAATCAGTTG 


(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 398 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

MET Ala Leu Lys Leu Asn Pro Leu Ala Ser Gin 
1 5 10 

Ser Ser Ala Arg Pro Pro lie Ser Thr Phe Arg 
20 25 

Cys Leu Ala Ser Ser Ser Pro Ala Leu Ser Ser 
35 40 

Leu Lys Lys Pro Phe Thr Pro Pro Lys Glu Val 
50 55 


AAAGAGCCAC 

CTTCATCTCT 

780 

TCAAGCTAGC 

CCAAATCTGC 

840 

ACACCAAGAT 

AGTTGAGAAG 

900 

CAGACATGAT 

GAGGAAGAAA 

960 

AAAGCCTCTT 

TGACAACTTC 

1020 

ACTATGCGGA 

CATTCTTGAG 

1080 

TTTCAGGTGA 

AGGAAACAAA 

1140 

GGTTGGATGA 

GAGAGCTCAA 

1200 

TACATGACAG 

AGAAGTGCAG 

1260 

GTCGTCCCTC 

ATTTGATCTA 

1320 

ATTGTTCGTG 

GGTCTAGTTA 

1380 

AGTCTTTAAA 

TGTTTTTGTG 

1440 

AACTGTTTGG 

TCTGT 

1495 


Pro Tyr Asn Phe Pro 
15 

Ser Pro Lys Phe Leu 
30 

Lys Glu Val Glu Ser 
45 

His Val Gin Val Leu 
60 
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His Ser MET Pro Pro Gin Lys lie Glu lie Phe Lys Ser MET Glu Asp 
65 70 75 80 

Trp Ala Glu Gin Asn Leu Leu Thr Gin Leu Lys Asp Val Glu Lys Ser 

85 90 95 

Trp Gin Pro Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Glu 
100 105 110 

Asp Gin Val Arg Glu Leu Arg Glu Arg Ala Arg Glu Leu Pro Asp Asp 
115 120 125 

Tyr Phe Val Val Leu Val Gly Asp MET lie Thr Glu Glu Ala Leu Pro 
130 135 140 

Thr Tyr Gin Thr MET Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr 
145 150 155 160 

Gly Ala Ser Pro Thr Ser Trp Ala lie Trp Thr Arg Ala Trp Thr Ala 

165 170 175 

Glu Glu Asn Arg His Gly Asp Leu Leu Asn Lys Tyr Leu Tyr Leu Ser 
180 185 190 

Gly Arg Val Asp MET Arg Gin lie Glu Lys Thr lie Gin Tyr Leu lie 
195 200 205 

Gly Ser Gly MET Asp Pro Arg Thr Glu Asn Asn Pro Tyr Leu Gly Phe 
210 215 220 

lie Tyr Thr Ser Phe Gin Glu Arg Ala Thr Phe lie Ser His Gly Asn 
225 230 235 240 

Thr Ala Arg Gin Ala Lys Glu His Gly Asp Leu Lys Leu Ala Gin lie 

245 250 255 

Cys Gly Thr lie Ala Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr 
260 265 270 

Lys lie Val Glu Lys Leu Phe Glu lie Asp Pro Asp Gly Thr Val Met 
275 280 285 

Ala Phe Ala Asp MET MET Arg Lys Lys lie Ser Met Pro Ala His Leu 
290 295 300 

Met Tyr Asp Gly Arg Asp Glu Ser Leu Phe Asp Asn Phe Ser Ser Val 
305 310 315 320 

Ala Gin Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu 

325 330 335 

Glu Phe Leu Val Gly Arg Trp Lys lie Glu Ser Leu Thr Gly Leu Ser 
340 345 350 
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Gly Glu Gly Asn Lys Ala Gin Glu Tyr Leu Cys Gly Leu Thr Pro Arg 
355 360 365 

lie Arg Arg Leu Asp Glu Arg Ala Gin Ala Arg Ala Lys Lys Gly Pro 
370 375 380 

Lys Val Pro Phe Ser Trp lie His Asp Arg Glu Val Gin Leu 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCTAAGCTTA ARGARATHCC AGAYGAYTA 29 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCTAAGCTTA ARGARATHCC GGAYGAYTA 29 
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(2) INFORMATION FOR SEQ ID NO: 23: 


(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 


29 base pairs 


(B) TYPE: 


nucleic acid 


(C) STRANDEDNESS: 


single 


(D) TOPOLOGY: 


linear 


(ii) MOLECULE TYPE: other nucleic acid 


(A) DESCRIPTION: synthetic oligonucleotide mixture 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GCTAAGCTTA ARGARATHCC CGAYGAYTA 29 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 


(ii) MOLECULE TYPE: other nucleic acid 


(A) DESCRIPTION: synthetic oligonucleotide mixture 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCTAAGCTTA ARGARATHCC TGAYGAYTA 29 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 


(C) STRANDEDNESS: 


single 


(D) TOPOLOGY: 


linear 


(C) STRANDEDNESS: 


single 


(D) TOPOLOGY: 


linear 


19 

m 


(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AGCGAATTCG TRTTNAGCAT NGTYTG 26 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AGCGAATTCG TRTTYAACAT NGTYTG 26 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TCTAGAATTC TCTAATTACG TCTGTTTGTT CTATTTTTTA TATGATATCA AATATTCGTC 60 

ATAAATATAT GGTTTAAGAT GCCAAAAAAT TATTTACTTG GTGAATATAA TACGTTAAAT 120 

ATTAGAAATA CATCATTTAG TTAAATAAAT AACCAAAAAC CAAAAATTCA TATCCGCGCT 180 

GGCGCGCGGT CAGGGTCTCG TTAGTTTTAA AATCAATGCA GTTTACAATT AATTTCCAGC 240 

20 


TGAAAATAAG TATAATTTGT ATTGAAATTA TAAAGTGACA TTTTTTGTGT AACAAATATT 300 

TTGTGTAACA AGAATTAAAA AAAAAAACAG AAAATACTCA GCTTTTTTAA TAATAAAAAA 3 60 

AATTAATTGA GTTAGAAAAT TGTTGTACCA AT AAC AAAAG ATTTATATGG AATTATAAAA 420 

TCAACACACC AATAACACAA GACTTTTTAA AAATTTAAGA ATAATATAAG CAATAACAAT. 480 

AGAATCTTCA AATTCTTCAA ATCCTTAAAA ATCAATCTCC CACTATTAAT CCCCCTTAGT 540 

TTTAGTTGGT AATGGCAACG TTTGTTGACT ACCGTATTGT AACTTTTGTC AAATTGTCAT 600 

AAATACGTGT CAAACTCTGG TAAAAAATTA GTCTGCTACA TCTGTCTTTT ATTTATAAAA 660 

CACAGCTGTT AATCAGAATT TGGTTTATTA AATCAACAAC CTGCACGAAA CTTGTGTGAG 720 

CATATTTTGT CTGTTTCTGG TTCATGACCT TCTTCCGCAT GATGGCCAAG TGTAATGGCC 780 

ACTTGCAAGA GCGTTTCTTC AACGAGATAA GTCGAACAAA TATTTGTCCG TTACGACCAC 840 

ATATAAAATC TCCCCATCTC TATATATAAT ACCAGCATTC ACCATCATGA ATACCTCAAA 900 

TCCCAATCTC ACAAATACTT CAATAAAAAG ACCAAAAAAA ATTAAAGCAA AGAAAAGCCT 9 60 

TCTTGTGCAC AAAAAAAAAA GAAGCCTTCT AGGTTTTCAC GACATGAAGT TCACTACTCT 1020 

AATGGTCATC ACATTGGTGA TAATCGCCAT CTCGTCTCCT GTTCCAATTA GAGCAACCAC 1080 

GGTTGAAAGT TTCGGAGAAG TGGCACAATC GTGTGTTGTG ACAGAACTCG CCCCATGCTT 1140 

ACCAGCAATG ACCACGGCAG GAGACCCGAC TACAGAATGC TGCGACAAAC TGGTAGAGCA 1200 

GAAACCATGT CTTTGTGGTT ATATTCGAAA CCCAGCCTAT AGTATGTATG TTACTTCTCC 12 60 

AAACGGTCGC AAAGTCTTAG ATTTTTGTAA GGTTCCTTTT CCTAGTTGTT AAATCTCTCA 1320 

AGACATTGCT AAGAAAAATA TTATTAAAAA TAAAAGAATC AAACTAGATC TGATGTAACA 13 80 

ATGAATCATC ATGTTATGGT TGAAGCTTAT ATGCTGAAGT GTTTGATTTT ATATATGTGT 1440 

GTGTGTGTGT CCTGCTCAAT TTTTGAAACA CACACGTTTC TCCTGATTTG GATTTAAATT 1500 

ATATTTTGAG TTAAAAAAAA GAAAAAGATG GAATGCTATT TATACAAGTT GATGAAAAAG 1560 

TGGAAGTACA ATTTAGATAT CTCCTACACT TAAAGAATGA AACAATAATA GACTTACGAA 1620 

ACAAATGAAA AATACATAAA TTGTCGACAA TCAACGTCCG ATGACGAGTT TATTATTAAA 1680 

AATTTGTGTG AAGGACTAGC AGTTCAACCA AATGATATTG AAC AT AT AC A TCAACAAATA 17 40 

TGATAATCAT AAAAGAGAGA ATGGGGGGGG GGTGTCGTTT ACCAGAAACC TCTTTTTCTC 1800 

AGCTCGCTAA AACCCTACCA CTAGAGACCT AGCTCTGACC GTCGGCTCAT CGGTGCCGGA 1860 
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GGTGTAACCT TTCTTTCCCA TGACCCGAAA CCTCTCTTTC CCAACTCACG AAAACCCTAC 1920 

AATCAAAAAC CTAGCTCCGA CCGTCGGCTC ATCGGTGCCG AAGGTGTAAC CTTTCTCTCC 1980 

CATCATAGTT TCTCGTAAAT GAAAGCTAAT TGGGCAATCG ATTTTTTAAT GTTTAAACCA 2040 

TGCCAAGCCA TTTCTTATAG GACAATTGTC AATAATAGCA TCTTTTGAGT TTTGTCTCAA 2100 

AAGTGACACT AGAAGAAAAA AGTCACAAAA ATGACATTCA TTAAAAAGTA AAATATCCCT 2160 

AATACCTTTG GTTTAAATTA AATAAGTAAA CAAAAATAAA TAAAAACAAA TAAAATAAAA 2220 

ATAAAAAATG AAAAAAAGAA ATTTTTTTAT AGTTTCAGAT TATATGTTTT CAGATTCGAA 2280 

ATTTTTTAAA TTCCCTTTTT TAAATTTTCT TTTTTGAAAT TTTTTTTTTT GAAATTTTTT 2340 

GAAACTGTTT TTAAAATTTT TATTTTTAAT TTTTTAGTAT TTATTTTTTA TTTTATAAAA 2 400 

TTTTAAACGC TAATTCCAAA ACTCCCCCCC CCCCCCCCCC CCCAATTCTC TCCTAGTCTT 2 460 

TTTCTCTTTC TTATATTTGG GCTTCTATCT TCTCTTTTTT TTTCAGGCCC AAAGTATCAT 2 520 

GTGTAACAAC CGGTGTTCAA AAACGCGCCC GCCTGGCCGT TTACTCGCCC GATTAAATGA 2 580 

TGATCGGAAG GCTGCCATGG CGAGGCGGAG GTAATCAGTG GTTCTAGGCG CTGAAACTAG 2 640 

AAAACCTTCA AAAATCGAAA TTTTAAGAGC TAAATCGGTG TTTATCTCAT GAATCTATTA 2700 

TATTTAGTTG AAACTCACAA GAATCGGTTG T AAAAAC TAT GAAATCGTGC AAAAAAAATG 2760 

AAGAACAAAA TATTCTCAGA TCTGGAAAAC ACAGAGAAGA GGTTGAAGAT GAGGGTAAAA 2820 

TCGTATTTTG TCATTCATTA AACTAAAATC AAAAAAAAAT GATGCAAAAT TCAATGATAA 2880 

TAACTCGAAC TCGCAACCAT ATGCATCTTT AGACTGCGAC ACGGACCACT AGACTAAGCA 2 940 

ATTTTAATGT TTATTCATCA CAGACCTAAT ATATGTCTAA AACTAGGCGC CGAGTACGCC 3 000 

CCGCTTAATC CCGAGTTTTT GTTAGCTCGC TAGACCCAGG GTCACCGCCC GACTAACGAG 3 060 

TAGCGTAATT CTGAACTGGG GTAACAACAT AGAGAACATC GCCGACCCTT CCCTGCCGAT 3120 

GATGCCGCCT CCGATGAACT TCCTGTAACG CCTTCAGTTT CCATTGATTT TCCCCTTTAA 3180 

TCTGATCAGT TCCATGTTTT ATCCAACTCA TCCCACTCCG TAGCATTTAA TCGATCTCAT 3 240 

CATTTACATA CATAACCAGT AGGAGGTCTC ATATAAATTT GAACGTTTCC AGCGATGAAC 3 300 

AGTGCCAATC TCTGCGAAAT CCATTTCTCT AAGCTCAGGG CTGGCGGCTG CAGCCCGGGG 33 60 

ATCCACTAGT TCTAGGCGGC CGCACCGCGG TGGAGCTCCA ATTCGCCCTA TAGTGAGTCG 3 420 

TATTACGCGC GCTCACTGGC 3 440 
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(2) INFORMATION FOR SEQ ID NO: 28: 


(i) SEQUENCE CHARACTERISTICS: 


(A) LENGTH: 


3898 base pairs 


(B) TYPE: 


nucleic acid 


(C) STRANDEDNESS: 


double 


(D) TOPOLOGY: 


linear 


(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CTCGAGAGCT GAAGGATTTT TTGTTAGAGA TTCAACGACA GATGGACCCT TCCTCCACTA 60 

GGCAACTGCA AGAACCTAAC AATGCAAATA TCACTCCTCC TCAGCCTTCA AGGAGCGTTA 120 

ATAGGACTGG AACAAGCGGT CAAGTGAGTA AATTTTCCTT CCAAGATAGA TCTCTATGGT 180 

TCGGTTCATG AAGTTTGTGG TTTAATTGTG TAGCAACAGG ATAGTGCAAG TGAGAATAGA 240 

GTTCGACCTC ATCTACCTAC CCCGGAACCT CTGAATGTAT CCCCATTGAA GAAGAAGAGG 300 

GCAAATCCTG CACCCAGAAG GATAAAGAAA TTTTGGACGC CTGAAGAAGT GGCAGTTCTG 3 60 

AGGGAAGGAG TAAAAGAGTA TGTCTACTAC TACTACTCTA TAATCAAGTT TCAAGAAGCT 420 

GAGCTTGGCT CTCACTTTAT ATGTTTGATG TTGTTGTGCA GGTATGGTAA ATCATGGAAA 480 

GAGATAAAGA ATGCAAACCC TGAAGTATTG GCAGAGAGGA CTGAGGTGAG AGAGCATGTC 540 

ACTTTTGTGT TACTCATCTG AATTATCTTA TATGCGAATT GTAAGTGGTA CTAAAAGGTT 600 

TGTAACTTTT GGTAGGTGGA TTTGAAGGAT AAATGGAGGA ACTTGCTTCG GTAGCGGTAA 660 

CAAGTTTTAT ATTGCTATGA AGTTTTTTTG CCTGCGTGAC GTATCAGCAG CTGTGGAGAA 720 

GATGGTATTA GAAAGGGTCT TTTCACATTT TGTGTTGTGA CAAATATTAA TTCGGCCGGT 780 

ATGGTTTGGT TAAGACTTGT TGAGAGACGT GTGGGGTTTT TTGATGTATA ATTAGTCTGT 840 

GTTTAGAACG AAACAAGACT TGTTGCGTAT GCTTTTTTTA ACTTGAGGGG GTTTGTTGTT 900 

GTTAGTTAGG AACTTGACTT TGTCTCTTTC TCTCAAGATC TGATTGGTAA GGTCTGGGTG 960 

GTAGTACTGT TTGGTTTAAT TTGTTTTGAC TATTGAGTCA CTGTGGCCCA TTGACTTTAA 1020 

ATTAGGCTGG TATATTTTTT GGTTTAAAAC CGGTCTGAGA TAGTGCAATT TCGATTCAGT 1080 


\3% 


CAATTTTAAA TTCTTCAAGG TAATGGGCTG AATACTTGTA TAGTTTTAAG ACTTAACAGG 1140 

CCTTAAAAGG CCCATGTTAT CATAAAACGT CATTGTTTAG AGTGCACCAA GCTTATAAAA 1200 

TGTAGCCAGG CCTTAAAAGA CTTAACAGGC CTTAAAAGAC TTAACATTCC TTAAAAGGCC 1260 

CATGTTATCA TAAAACGTCA TCGTTTTGAG TGCACCAAGC TAAATGTAGC CAGGCCTTAA 1320 

AAGACTTAAC AGGCCTTAAA AGGCCCATGT TATCATAAAA CGCCGTCGTT TTGAGTGCAC 1380 

CAAGCTTATA AATGTAGCCA GCTACCTCGG GACATCACGC TCTTTGTACA CTCCGCCATC 1440 

TCTCTCTCTC TCGAGCAGAT CTCTCTCGGG AATATCGACA ATGTCGACCA CTTTCTGCTC 1500 

TTCCGTCTCC ATGCAAGCCA CTTCTCTGGT AATCTCATCT CCTTCTTGTG TTCCCAGATC 1560 

GCTCTGATCA TACTTTCTTT TAGATCATTT GCCTCTGATC TGTTGCTTGA TGTTTGTTAA 1620 

CTCTCCACGC ATGTTTGATT ATGTTGAGAA TTAGAAAAAA AATGTTAGCT TTACGAATCT 1680 

TTAGTGATCA TTTCAATTGG ATTTGCAATC TTGTGTGACA TTTGAGGCTT GTGTAGATTT 1740 

CGATCTGTAT TCATTTTGAA TCACAGCTAT AATAGTCATT TGAGTAGTAG TGTTTTTAAA 1800 

TGAACATGTT TTGTTGTATT GATGGAACAA ACAGGCAGCA ACAACGAGGA TTAGTTTCCA 1860 

GAAGCCAGCT TTGGTTTCAA CGACTAATCT CTCCTTCAAC CTCCGCCGTT CAATCCCCAC 1920 

TCGTTTCTCA ATCTCCTGCG CGGTATGTTC TCATTCTCAG CATTTATTTC GAGCTTGCTT 1980 

GTCATGGTAC TCTCTCTAAT TGTCTATTTG GTTTATTAGG CCAAACCAGA GACGGTTGAG 2040 

AAAGTGTCTA AGATAGTTAA GAAGCAGCTA TCACTCAAAG ACGACCAAAA GGTCGTTGCG 2100 

GAGACCAAGT TTGCTGATCT TGGAGCAGAT TCTCTCGACA CTGTAAGTCA TCAATCATTC 2160 

TCTTATGTGA ATAAAGAGAA CTTGAAGAGT TTGTTTTTAA CATATTAACT GAGTGTTTTG 2220 

CATGCAGGTT GAGATAGTGA TGGGTTTAGA GGAAGAGTTT GATATCGAAA TGGCTGAAGA 2280 

GAAAGCTCAG AAGATTGCTA CTGTGGAGGA AGCTGCTGAA CTCATTGAAG AGCTCGTTCA 2340 

ACTTAAGAAG TAATTTTAGT ATTAAGAGCA GCCAAGGCTT TGTTGGGTTT GTTGTTTTCA 2400 

TAATCTTCCT GTCATTTTCT TTTTCTTTAA TGTGTCAAGC GACTCTGTTG GTTTAAAGTA 2460 

GTATCTGTTT GCCATGGATC TCTCTCTATT TGTCGACTGA AAACTTTTGG TTTACACATG 2520 

AAAGCTTGTT CTTGTTCTTT CTTAAATCGA AATGCCAAAT GCGAGATTAG GGAATCTTGT 2580 

ATTAACACAT ACATAAGTCA AAGAGTAGGC CCTAAGATGA CAATTTATAA ACAATCCTAT 2 640 

TCACATTGTA TATACAGGTT ATGATTATTC CCAATCAGCG TCAAAGAATC CAGCATCTTT 2700 
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CATCTCTGAA TAGTAGACAT TCTCCAAGTT CACATCTTCC TCCTGCACCA AAAACCAGTA 27 60 

CTAAATCATG AACATTGCAA TAATCACATG CCTAGGCGAG AGTTTTGGTG ATGTGGTGTT 2820 

AGTGATAGTG ATACTGATGG TGCTAGAGCG GTTAAGAAGG ATTAACCTGG AAGAAGTCTG 2 880 

CAAGGAAAGT AACATAGAGA AGAGGAAGAT AGGAGTGGTA ACAAACACTT GTGATCCCAT 2940 

ACAGCCTCCC AGCATTTTTC AAATGTTATT TCCTTACATA AAGAAACAAG AGAAGTCTGA 3 000 

CTAGATGATA TTTATATAGG ATAAGTGTTT TACCATAAGC CAAAGTGAGC GCCGTTTGCA 3 060 

AGAGCTAACC AGACAGTACA CGTTTGGCAT ATATCTCATC AACATGATCT GAAAAGTAAC 3120 

ATATCACAGT TAATGAACAC AATGGTTACC TTGAGAAGCA AATCAAGACC TATAACAAGC 3180 

CCAGAGATGA GGAAAGTCCG TGTCAACGCT TCACCGCCAT TCGCGTAGTT TCCTTGGAAG 3240 

ACAAAGGCCA CCAACCAAAC TTACTTCCAG AAACAACACT CCAAATGTTG TCAACAAAGT 3300 

CAATAGATTC CAAACTACTT CGTTACAGGG TTGTATAGAT AATATAATAG AATAGTGGGA 33 60 

AGATAGTATA AATAAAATAA ATAAAAGATC CTATCGGTAA ATAGTTTATA ATATCGGGGG 3 420 

CGTATATAAA GTATAAAAGA AACTCTTCTC CAATCCGACC GTTGAAAATC ACTCTCAATC 3 480 

TCTGGCGTAA CGACCGGATC GTTCGCGCGT AATTTTCGCT GCTATAAATA GAAACTTTCC 3 540 

TCTTCTGTTT CTCGATCAAA ATTTTTTTTT GGAAAAATTA AGTTTGAATC TATCGTAGAT 3 600 

GCTGTGACAA AAAAAAATTG TTTTATCGAA GATGAGAAAC ATGAGGCCTG TTCATGCAAG 3 660 

GAACCAGACC ACGGATCCAT CTTCGCCGAT GATGACGTCT CCTCTGATGA ATCGTCACGC 3720 

ACGGACAGGA TCCAACGCTG GACCAGCATC TAACGCCAAG AAAGCACAGA CGAAAGCAGC 3780 

AGCTCAGAGA CTCGCGGCTG TGATGTCGAA CCAAACAGGC GACGATGAAG ACAGTGATGA 3 840 

TGACCTTTCC TTTGACTACA ACGCTGTCGG AAGCATTGGT CTCGCTGCCG GAAGATCT 3 898 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4325 base pairs 

(B) TYPE: nucleic acid 


(C) STRANDEDNESS: 


double 


(D) TOPOLOGY: 


linear 
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(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CTCGAGGCAG TCACTAACAT GAAGTTTGAC GAGGAGCCCA ACTATGGGAA GCTTATTTCT 60 

CTTTTCGATA CTCTAATTGA GCCGTGCGCT CTATCTAGAC CAATTAGAAT TGATGGAGCT 120 

CTAAAGGTTG CTGGCTGTTT TCTTGTTCAT ATGATTAACT TCTAAACTTG TGTATAAATA 180 

TTCTCTGAAA GTGCTTCTTT TGGCATATGT AGGTTGGGCA AAAACGAGGA AGATTGCTTC 240 

TCAATTTGGA AGAGGATGAA CAGCCGAAGA AGAAAATAAG AATAGGCAGT CCTGCTACTC 300 

AATGGATCTC AGTCTATAAC GGTCGTCGTC CCATGAAACA GAGGTAAAAC ATTTTTTGCA 3 60 

TATACACTTT GAAAGTTCCT CACTAACTGT GTAATCTTTT GGTAGATATC ACTACAATGT 420 

CGGAGAGACA ANGGCTGSNC ANCATATACA AAAGGGAAAT GAAGATGGCC TTTTGATTAG 480 

CTGTGTAGCA TCAGCAGCTA ATCTCTGGGC TCTCATCATG GATGCTGGAA CTGGATTCAC 540 

TTCTCAAGTT TATGAGTTGT CACCGGTCTT CCTACACAAG GTAATAATCA GTTGAAGCAA 600 

TTAAGAATCA ATTTGATTTG TAGTAAACTA AGAAGAACTT ACCTTATGTT TTCCCCGCAG 660 

GACTGGATTA TGGAACAATG GGAAAAGAAC TACTATATAA GCTCCATAGC TGGTTCAGAT 720 

AACGGGAGCT CTTTAGTTGT TATGTCAAAA GGTTAGTGTT TAGTGAATAA TAAACTTATA 780 

CCACAAAGTC TTCATTGACT TATTTATATA CTTGTTGTGA ATTGCTAGGA ACTACTTATT 840 

CTCAGCAGTC ATACAAAGTG AGTGACTCAT TTCCGTTCAA GTGGATAAAT AAGAAATGGA 900 

AAGAAGATTT TCATGTAACC TCCATGACAA CTGCTGGTAA TCGTTGGGGT GTGGTAATGT 960 

CGAGGAACTC TGGCTTCTCT GATCAGGTAG GTTTTTGTCT CTTATTGTCT GGTGTTTTTA 1020 

TTTTCCCCTG ATAGTCTAAT ATGATAAACT CTGCGTTGTG AAAGGTGGTG GAGCTTGACT 1080 

TTTTGTACCC AAGCGATGGG ATACATAGGA GGTGGGAGAA TGGGTATAGA ATAACATCAA 1140 

TGGCAGCAAC TGCGGATCAA GCAGCTTTCA TATTAAGCAT ACCAAAGCGT AAGATGGTGG 1200 

ATGAAACTCA AGAGACTCTC CGCACCACCG CCTTTCCAAG TACTCATGTC AAGGTTGGTT 12 60 

TCTTTAGCTT TGAACACAGA TTTGGATCTT TTTGTTTTGT TTCCATATAC TTAGGACCTG 1320 

AGAGCTTTTG GTTGATTTTT TTTTCAGGAC AAATGGGCGA AGAATCTGTA CATTGCATCA 1380 

ATATGCTATG GCAGGACAGT GTGCTGATAC ACACTTAAGC ATCATGTGGA AAGCCAAAGA 1440 

CAATTGGAGC GAGACTCAGG GTCGTCATAA TACCAATCAA AGACGTAAAA CCAGACGCAA 1500 
t 
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CCTCTTTGGT TGAATGTAAT GAAAGGGATG TGTCTTGGTA TGTATGTACG AATAACAAAA 1560 
GAGAAGATGG AATTAGTAGT AGAAATATTT GGGAGCTTTT TAAGCCCTTC AAGTGTGCTT 1620 
TTTATCTTAT TGATATCATC CATTTGCGTT GTTTAATGCG TCTCTAGATA TGTTCCTATA 1680 
TCTTTCTCAG TGTCTGATAA GTGAAATGTG AGAAAACCAT ACCAAACCAA AATATTCAAA 17 40 
TCTTATTTTT AATAATGTTG AATCACTCGG AGTTGCCACC TTCTGTGCCA ATTGTGCTGA 1800 
ATCTATCACA CTAGAAAAAA ACATTTCTTC AAGGTAATGA CTTGTGGACT ATGTTCTGAA 1860 
TTCTCATTAA GTTTTTATTT TCTGAAGTTT AAGTTTTTAC CTTCTGTTTT GAAATATATC 1920 
GTTCATAAGA TGTCACGCCA GGACATGAGC TACACATCGC ACATAGCATG CAGATCAGGA 1980 
CGATTTGTCA CTCACTTCAA ACACCTAAGA GCTTCTCTCT CACAGCGCAC ACACATATGC 2040 
ATGCAATATT TACACGTGAT CGCCATGCAA ATCTCCATTC TCACCTATAA ATTAGAGCCT 2100 
CGGCTTCACT CTTTACTCAA ACCAAAACTC ATCACTACAG AACATACACA AATGGCGAAC 2160 
AAGCTCTTCC TCGTCTCGGC AACTCTCGCC TTGTTCTTCC TTCTCACCAA TGCCTCCGTC 2220 
TACAGGACGG TTGTGGAAGT CGACGAAGAT GATGCCACAA ATCCAGCCGG CCCATTTAGG 2280 
ATTCCAAAAT GTAGGAAGGA GTTTCAGCAA GCACAACACC TGAAAGCTTG CCAACAATGG 2340 
CTCCACAAGC AGGCAATGCA GTCCG.GTAGT GGTCCAAGCT GGACCCTCGA TGGTGAGTTT 2400 
GATTTTGAAG ACGACGTGGA GAACCAACAA CAGGGCCCGC AGCAGAGGCC ACCGCTGCTC 2460 
CAGCAGTGCT GCAACGAGCT CCACCAGGAA GAGCCACTTT GCGTTTGCCC AACCTTGAAA 2520 
GGAGCATCCA AAGCCGTTAA ACAACAGATT CGACAACAAC AGGGACAACA AATGCAGGGA 2580 
CAGCAGATGC AGCAAGTGAT TAGCCGTATC TACCAGACCG CTACGCACTT ACCTAGAGCT 2640 
TGCAACATCA GGCAAGTTAG CATTTGCCCC TTCCAGAAGA CCATGCCTGG GCCCGGCTTC 2700 
TACTAGATTC CAAACGAATA TCCTCGAGAG TGTGTATACC ACGGTGATAT GAGTGTGGTT 27 60 
GTTGATGTAT GTTAACACTA CATAGTCATG GTGTGTGTTC CATAAATAAT GTACTAATGT 2 820 
AATAAGAACT ACTCCGTAGA CGGTAATAAA AGAGAAGTTT TTTTTTTTAC TCTTGCTACT 2880 
TTCCTATAAA GTGATGATTA ACAACAGATA CACCAAAAAG AAAAC AATT A ATCTATATTC 2940 
ACAATGAAGC AGTACTAGTC TATTGAACAT GTCAGATTTT CTTTTTCTAA ATGTCTAATT 3 000 
AAGCCJTCAA GGCTAGTGAT GATAAAAGAT CATCCAATGG GATCCAACAA AGACTCAAAT 3060 
CTGGTTTTGA TCAGATACTT CAAAACTATT TTTGTATTCA TTAAATTATG CAAGTGTTCT 3120 
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TTTATTTGGT GAAGACTCTT TAGAAGCAAA GAACGACAAG CAGTAATAAA AAAAACAAAG 3180 
TTCAGTTTTA AGATTTGTTA TTGACTTATT GTCATTTGAA AAATATAGTA TGATATTAAT 3240 
ATAGTTTTAT TTATATAATG CTTGTCTATT CAAGATTTGA GAACATTAAT ATGATACTGT 3300 
CCACATATCC AATATATTAA GTTTCATTTC TGTTCAAACA TATGATAAGA TGGTCAAATG 33 60 
ATTATGAGTT TTGTTATTTA CCTGAAGAAA AGATAAGTGA GCTTCGAGTT TCTGAAGGGT 3 420 
ACGTGATCTT CATTTCTTGG CTAAAAGCGA ATATGACATC ACCTAGAGAA AGCCGATAAT 3 480 
AGTAAACTCT GTTCTTGGTT TTTGGTTTAA TCAAACCGAA CCGGTAGCTG AGTGTCAAGT 3 540 
CAGCAAACAT CGCAAACCAT ATGTCAATTC GTTAGATTCC CGGTTTAAGT TGTAAACCGG 3 600 
TATTTCATTT GGTGAAAACC CTAGAAGCCA GCCANCCTTT TTAATCTAAT TTTTGCAAAC 3 660 
GAGAAGTCAC CACACCTCTC CACTAAAACC CTGAACCTTA CTGAGAGAAG CAGAGNCANN 3720 
AAAGAACAAA TAAAACCCGA AG AT G AG AC C ACCACGTGCG GCGGGACGTT CAGGGGACGG 3780 
GGAGGAAGAG AATGRCGGCG GNSNTTTGGT GGCGGCGGCG GACGTTTTGG TGGCGGCGGT 3 840 
GGACGTTTTG GTGGCGGCGG TGGACCTTTG GTGGTGGATA TCGTGACGAA GGACCTCCCA 3 900 
GTGAAGTCAT TGGTTCGTTT ACTCTTTTCT TAGTCGAATC TTATTCTTGC TCTGCTCGTT 3 960 
GTTTTACCGA TAAAGCTTAA GACTTTATTG ATAAAGTTCT CAGCTTTGAA TGTGAATGAA 4020 
CTGTTTCCTG CTTATTAGTG TTCCTTTGTT TTGAGTTGAA TCACTGTCTT AGCACTTTTG 4080 
TTAGATTCAT CTTTGTGTTT AAGTTAAAAG GTAGAAACTT TGTGACTTGT CTCCGTTATG 4140 
ACAAGGTTAA CTTTGTTGGT TATAACAGAA GTTGCGACCT TTCTCCATGC TTGTGAGGGT 4200 
GATGCTGTGG ACCAAGCTCT CTCAGGCGAA GATCCCTTAC TTCAATGCCC CAATCTACTT 4260 
GGAAAACAAG ACACAGATTG GGAAAGTTGA TGAGATCCAA GCTTGGGCTG CAGGTCGACG 4320 
AATTC 4325 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 


(C) STRANDEDNESS: 


single 


(D) TOPOLOGY: 


linear 


IM3 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGGATCCACT GCAGTCTAGA GGGCCCGGGA 30 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AATTTCCCGG GCCCTCTAGA CTGCAGTGGA TCCGAGCT 3 8 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTAAGTAGGT AGGGCTTCCT CTGTAATCAT ATCTCCAACC AAAACAACAA 50 
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(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTAAGAAGT AACCCGGGCT GCAGTTTTAG TATTAAGAG 39 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGAATTCGTC GACAGATCTC TGCAGCTCGA GGGATCCAAG CTT 43 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCATTTTTGA TCTTCCTCGA GCCCGGGCTG CAGTTCTTCT TCTTCTTG 48 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCTCGTTTTT TTTTTCTCTG CAGCCCGGGC TCGAGTCACA GCTTCACC 48 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE' TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ACTGACTGCA GCCCGGGCTC GAGGAAGATC AAAAATGGCT CTTC 44 
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(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAGTAGTGAA CTTCATGGAT CCTCGAGGTC TTGAAAACCT AGA 43 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CAATGTCTTG AGAGATCCCG GGATCCTTAA CAACTAGGAA AAGG 44 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTAAGACACG ACTTATCGCC ACTG 24 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGAATTCGTC GACAGATCTC TGCAGCTCGA GGGATCCAAG CTT 43 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GCTTGTTCGC CATGGATATC TTCTGTATGT TC 32 
(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA to mRNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

GAT GCC AAA ANG CCT CAC ATG CCT CCT AGA GAA GCT CAT GTG CAA AAG 48 
Asp Ala Lys Xaa Pro His MET Pro Pro Arg Glu Ala His Val Gin Lys 
15 10 15 

ACC CAT TCA ATK CCG CCT CAA AAG ATT GAG ATT TTC AAA TCC TTG GAG 9 6 
Thr His Ser Xaa Pro Pro Gin Lys lie Glu lie Phe Lys Ser Leu Glu 
20 25 30 

GGT TGG GCT GAG GAG AAT GTC TTG GTG CAT CTT AAA CCT GTG GAG AA 143 
Gly Trp Ala Glu Glu Asn Val Leu Val His Leu Lys Pro Val Glu 
35 40 45 
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